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HAMILTONIAN SYSTEMS: STABILITY AND INSTABILITY THEORY 



PATRICK BERNARD 



The solar system has long appeared to astronomers and mathematicians as a model of sta- 



OO 

f~^ I bility. On the other hand, statistical mechanics relies on the assumption that large assemblies 

CN ' of particles form highly unstable systems (at the microscopic scale). Yet all these physi- 

cal situations are described, at least to a certain degree of approximation, by Hamiltonian 
systems. 

One may hope that Hamiltonian systems can be classified in two different categories, stable 
and unstable ones. However, the situation is much more complicated and both stable and 
unstable behaviors cohabit in typical systems. Even our examples are not perfect paradigms 
(^ . of stability and instability. Indeed, it is now clear from numerical as well as theoretical points 

Q I of views that some instability is present over long time-scales in the solar systems, so that 

for example future collisions between planets can not be completely ruled out in view of 
our present understanding. On the other hand, unexpected patterns of stability have been 
discovered in systems involving a large number of particles. 

Understanding the impact of stable and unstable effects in Hamiltonian systems has been 
considered since Poincare as one of the most important questions in dynamical systems. In 
the present text, we will discuss model Hamiltonian systems of the form 



Heiq,p) = h{p) +eGeiq,p) 

where {q,p) G T x [/, with U a bounded open subset of M". Recall that the equations of 
motion are 

(1) q{t)=dph{p) + edpG,{q,p) 

(2) p{t) = -ed,G,{q,p). 



The textbook [T] is a good general introduction on Hamiltonian systems. We will always 
C^ I denote by uj{p) the frequency map dph{p), which plays a crucial role. Here, as is obvious 

in (l2|), the action variables p are preserved under the evolution in the unperturbed case 
e = 0. We will try to explain what is known on the evolution of these action variables 
for the perturbed system. As we will see, in many situations, these variables are extremely 
stable. For example, KAM theorem implies that, for a positive measure of initial conditions 
(gO)Po) the trajectory {q{t),p{t)) satisfies \\p{t) — p(0)|| ^ Ce for all times. Examples show 
that some initial conditions may lead to unstable trajectories, that is trajectories such that 
\\p{t) — p{0)\\ ^ 1/C for some t (depending on e) and some fixed constant C independent of 
e. However, this is, as we will see, possible only for very large time t (meaning that t as a 
function of e has to go to infinity very quickly when e — > 0). The main questions here are 
to understand in what situation instability is or is not possible, and what kind of evolutions 
can have the actions variable p. Another important question is to estimate the speed (as a 
function of the parameter e) of the evolutions of p. 
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0.1. A convention. We assume, unless otherwise stated, that the Hamiltonians are real 
analytic. The norm \H\ of the Hamiltonian H is the uniform norm of its holomorphic extension 
to a certain complex strip. We do not specify the width of this strip. Whenever we consider 
a family iJ^, F^. . . of Hamiltonians, we mean that the norm \H^\ is bounded when e — > 0. 

1. Averaging and exponential stability. 

The first observation concerning the action variable is that they should evolve at a speed 
of the order of e. However, averaging effects occur. More precisely, in the equation p{t) = 
—edgHf.{q{t),p{t)), the variable q{t) is moving fast compared to p{t). If the evolution of q{t) 
nicely fills the torus T", it is tempting to think that the averaged equation 

m = -eVMt)) 
should approximate accurately the actual behavior of p(t), where 

V,{p) := [ d,H{q,p)dq. 

We have V = 0, which leads to think that the evolution should consist mainly of oscillations 
of small amplitude with no large evolution. This reasoning is limited by the presence of 
resonances. 

1.1. Frequencies. A frequency cj G M°' is said resonant if there exists k £ Z^(= ^^ — {0}) 
such that {kjLo) = 0. The resonance module of u>, 

Z{uj) = {k£Z'^/{k,Lo) =0} 

is a subgroup Z, , we denote by R{uj) the vector space generated by Z{lo) in M . The order of 
resonance r{uj) is the dimension of R{lo). The main examples of a resonances of order r are the 
frequencies uj = (tt;i,0) where u}\ € M is non resonant. This example is universal. Indeed, 
if a; is a resonant frequency, then there exists a matrix A £ GldC^) such that Au = (tJi,0), 
where wi € M is not resonant. The matrix A can be seen as a diffeomorphism of T , which 
transports the constant vector- field uj to the constant vector-field Auj = (0^1,0). It is useful 
to distinguish, among non resonant frequencies, some which are sufficiently non-resonant. A 
frequency u; S M is called Diophantine if there exists real constants 7 > and t ^ d such 
that 

\{u;,k)\^^\\k\\'-^ 

for each k E Z^. Finally, a frequency is called resonant-Diophantine if there exists a matrix 
A G Glni'Z) such that lo = A{loi, 0), where loi £ M"^^ is a Diophantine frequency. 

1.2. Symplectic diffeomorphisms and Normal Forms. An efficient mathematical method 
to take averaging effects into account is the use of normal forms. Normal form theory consists 
in finding new coordinates in which the fast angles have been eliminated from the equations 
up to a small remainder. This is done exploiting the existence of a large group of diffeomor- 
phisms preserving the Hamiltonian structure of equations, called symplectic diffeomorphisms 
or canonical transformations. We refer the reader to standard textbooks for these notions, 
for example to [Ij. An important point is that a symplectic diffeomorphism (p sends the tra- 
jectories of the Hamiltonian H o 4> to the trajectories of the Hamiltonian H. A Hamiltonian 
N{q,p) is said in i?-normal form, where -R is a linear subspace of M", if dqN G R for each 
{q,p). Let us give an illustrative result, taken from [8]. Note that this result is not sufficient to 
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obtain uniform stability estimates, as in Nekhoroshev Theorem below. More precise normal 
form results are given in [9j and |llj . 

1.3. Normal form theorem. LetojQ = u;(po) be a given Diophantine or resonant-Diophantine 
frequency. Let us denote Br{po) the open ball of radius r in M!^ centered at po. There exists a 
constant a which depends only on tu, and constants eg > and C > such that the following 
holds: For cache < eg, there exists an analytic symplectic embedding (f)^ : f xB^u\ — > T xU , 
which is e-close to identity and such that 

He o 0e(g,p) = h{p) + ^Ne{q,p) + n{e)Fe{q,p), 

where N is in R{loo) -normal form, r(e) ^ ^/e, and fj,{e) ^ e~'^'^ ". 

This means that the motions with resonant initial conditions are confined, up to small 
oscillations, in the associated affine plane p{0) + R{uj{p{0)) until they live the domain of the 
normal form, or until time fi~^{e). 

1.4. Geometry of resonances. In view of the Normal Form Theorem, we are led to consider 
the curves P{9) : M — > M'^ which satisfy 

Pie')-p{e)GR{u;{p{e))) 

for each 9 and 9' . Indeed, it appears that these curves are the ones the action variables can 
follow on time-scales not involving the remainders of the normal forms. Note that here the 
parameter 9 is not the physical time. Assuming that P{9) is such a curve, we can define the 
affine space 

R:=P{0)+ne^RRiu;{P{9))). 

We then have P{9) G R for each 9. In addition, each point P{9),9 G M is a critical point 
of the restriction /ii^ of the unperturbed Hamiltonian h to the affine space R. It follows 
that the curve P{9) has to be constant if the unperturbed Hamiltonian satisfies the following 
hypothesis. 

1.5. Nekhoroshev steepness. We say that the unperturbed Hamiltonian h is steep if, for 
each affine subspace A inW^, the restriction /lu has only isolated critical points. 

This formulation, due to L. Niederman, [lOj . is much simpler than the equivalent one first 
given by Nekhoroshev in [9]. It turns out that this condition, which was made natural by our 
heuristic explanation, implies stability over exponential time-scales for all initial conditions, 
see [9]. We first need another condition. 

1.6. Kolmogorov non-degeneracy. We say that the unperturbed Hamiltonian h is non- 
degenerate in the sense of Kolmogorov if it has non- degenerate Hessian at each point, or 
equivalently if the frequency map p > — > ^(p) is an immersion. 

1.7. Nekhoroshev stability theorem. Assume that the unperturbed Hamiltonian does not 
have critical points (uj{p) does not vanish), satisfies Nekhoroshev steepness and Kolmogorov 
non- degeneracy conditions. Then there exists constants a > and 6 > 0, which depend only 
on h, and constants eo > and C > such that the following holds: For e < eq, each 
trajectory {qe{t)^pe{t)) satisfies the estimate 

\\Pe{t)-pM\\^Ce'' 

for all t such that \t\ ^ e*^^ ". 
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1.8. Herman's example. In oder to illustrate the necessity of the condition of steepness, 
let us consider the Haniiltonian 

He{qi,q2,Pi,P2) =PiP2 + (^V{qi). 

with V : T — > M. The associated equations are 

P2 = 0, pi = -V', qi=P2, q2=Pi- 

The trajectories whose initial condition are subjected to P2(0) = and V'{qi{0)) ^ satisfy 

pl{t)=pi{0)-teV'{qim, P2(t)=0, gi(t)=gi(0). 

We see an evolution at speed e of the action variable pi contradicting the conclusion of 
Nekhoroshev theorem. In this example, we have R{uj{p{t))) = M x {0}, and /iiKxjo} = Oi ^o 
that the curve 

Pie) = (9,0) 

is indeed a curve of critical points of /i|kx{o}- 

1.9. Genericity of steepness. The condition of steepness is frequently satisfied. In order 
to be more precise, we mention that, for N £ N large enough (how large depends on the 
dimension d), steepness is a generic condition in the finite dimensional space of polynomials 
of degree less that A^. Note in contrast that a quadratic Hamiltonian is steep if and only if 
it is positive definite. Finally, it is important to mention that convex Hamiltonians h with 
positive definite Hessian are steep. More generally, quasi-convex Hamiltonians are steep. A 
function h : U — > M is said quasi-convex if, at each point, the restriction of its Hessian to 
the kernel of its differential is positive definite. 

1.10. The quasi-convex case. It is interesting to be more precise about the values of a and 
b in Nekhoroshev Theorem. We shall do so in the quasi-convex case, which is the most stable 
case, and where much more is known. If h is quasi-convex, one can take 

a = b = — -, 
2d 

as was proved by P. Lochak, see [7j . It is a question of active present research whether these 

exponents are optimal. It now appears that this is almost so, and that the optimal exponent 

a should not be larger than l/2(d — 3). That this exponent deteriorates as the dimension 

increases is of course very natural in the perspective of statistical mechanics. As a matter of 

fact, not only the exponent a, but also the threshold eo of validity of Nekhoroshev Theorem 

deteriorates with the dimension, as was noticed in [5]. 

Another important fact was proved in [7j: in these expressions, the important value of d 

is not the total number of degrees of freedom, but the number of active degrees of freedom. 

More precisely, resonant initial condition are more stable than generic ones. If r is the order of 

resonance of a given initial condition, then the number d — r of fast angles can be substituted 

to the total number of degrees of freedom for the computation of the stability exponent. This 

phenomenon may account for the surprising stability obtained numerically by Fermi, Pasta 

and Ulam. 

2. Permanent stability. 

Many initial conditions satisfy more than exponential stability: they are permanently sta- 
ble. 
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2.1. Kolmogorov Theorem. Assume that h satisfies Kolmogorov non- degeneracy condition 
li.6l Then for each open subset V C M such that V <Z U , there exists eo > such that, for 

each e < eo there exists 

• a smooth symplectic embedding (f)^ iT'^xV — > T'^ x U, which is e-close to the identity, 

• a compact subset F^ of V, whose relative measure in V is converging to 1 as e — > 0, 

such that the Hamiltonian system H^ o (p^ preserves the torus T'^ x {p} for each p G F^. 
The union 

of all the invariant tori has positive measure. Its complement is usually an open dense subset 
of T"^ X U. All the orbits starting in this invariant set obviously undergo oscillations of 
amplitude of the order of e for all times. It is worth mentioning that some energy surfaces 
may not intersect the invariant set J-^- This is illustrated in example II. 8 ^ where the surface of 
zero energy does not contain invariant tori. The following condition guaranties the existence 
of invariant tori on each energy surface. 

2.2. Arnold non-degeneracy. The Hamiltonian h is said to be non- degenerate in the sense 
of Arnold if it does not have critical points and if the map 

is a local diffeomorphism between each level set of h and S"^^^ . This is equivalent to say that 
the function {X,p) G M x [/ i — > Xh{p) has non- degenerate Hessian at each point of the form 

ihp). 

2.3. Arnold Theorem. If h satisfies Arnold non- degeneracy condition, then the relative 
measure of the set T^ of invariant tori is converging to 1 in each energy surface. 

This theorem prevents ergodicity of the perturbed systems for the canonical invariant mea- 
sure on its energy surface. This may be considered as a very disappointing result for statistical 
mechanics, whose mathematical foundation has often be considered to be the Boltzmann hy- 
pothesis of ergodicity. However, statistical mechanics is first of all a question of letting d go 
to infinity, and ergodicity might not be such a crucial hypothesis, see [6]. 

When d = 2, The theorem of Arnold has particularly strong consequences. Indeed, in this 
case, the invariant tori cut the energy surfaces in small connected components. The motion 
is then confined in these connected components. As a consequence, we obtain permanent 
stability for all initial conditions. 

In higher dimension however, the complement of J^^ in each energy shell is usually a dense, 
connected open set. There may exist orbits wandering in this large connected set, although 
the speed of evolution of these orbits is limited by Nekhoroshev theory. Understanding the 
dynamics in this open set is a very important and hard question. It is the subject of the next 
section. 

2.4. Relaxed assumption. For many applications, such as celestial mechanics, the non- 
degeneracy conditions of Arnold or Kolmogorov are not satisfied, or hard to check. However, 
the existence of invariant tori has been proved under much milder assumptions. As a rule. 
Invariant tori exist in the perturbed systems if the frequency map p i — > ^(p) stably contains 
Diophantine vectors in its image. 
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3. The Mechanism of Arnold 

Understanding instability is the subject of intense present research. General methods of 
construction of interesting orbits, as well as clever classes of examples are being developed. 
These methods are exploring the limits of stability theory. Here we shall only describe the 
foundational ideas of Arnold, see [Ij, where most of the present activity finds its roots. Al- 
though these ideas have some ambition of universality, they are best presented, like in [1], on 
an example. We consider the quasi-convex Hamiltonian 

Hiqi,q2,q3,Pi,P2,P3) = (pI + pI)/2 -p3 + e cos 2Trq2 + n{cos2iTq2){cos2iTqi +cos27rq'3). 

As we have seen, this system is typical of the kind of Hamiltonians one gets after reduction 
to resonant normal form. However, it is illuminating to consider // not as a function of e but 
as an independent parameter. This is an idea of Poincare then followed by Arnold. We shall 
expose the main steps of the proof of the following result. 

3.1. Theorem. Let us fix numbers < A < B. For each e > 0, there exists a number fio{^) 
such that, when < fi < fio{e) , there exists a trajectory 

(.qi(.t),q2(.t),pi{t),p2{t)) 

and a time T > (which depends on e and fi) such that 

piiO) ^ A , pi{T) ^ B. 



3.2. The truncated system. Let us begin with some remarks about the truncated Hamil- 
tonian obtained when fi = 0: 

Ho{q,p) = Hi{qi,qs,pi,p3) + H2iq2,P2) = pl/2 - p3 + pl/2 + ecos2iTq2. 

This system is the uncoupled product of Hi and of the pendulum described by H2. The 
variable pi is constant along motion, hence the Theorem can not hold for fi = 0. 

Recall that the point (72 = 0,p2 = is a hyperbolic fixed point of the pendulum H2{q2-,P2) = 
p'2/2 + ecos27rq'2- The stable and unstable manifolds of this integrable system coincide, they 
from the energy level H2 = e. As a consequence, in the product system of Hamiltonian 
Hq = Hi + H2, there exists, in the zero energy level, a one parameter family T^^ of invariant 
tori of dimension 2 

7L = {Pi =uj,P3= o?l2 + e,q2 = 0,P2 = 0} C T^ x R^. 

Each of these tori is hyperbolic in the sense that it has a stable manifold of dimension 3 and 
an unstable manifold of dimension 3, which are nothing but the liftings of the stable and 
unstable manifolds of the hyperbolic fixed point of H2- Notice that these manifolds do not 
intersect transversally along T^. 

When /u 7^ 0, the perturbation is chosen in such a way that the tori T^^ are left invariant 
by the Hamiltonian flow. 



HAMILTONIAN SYSTEMS: STABILITY AND INSTABILITY THEORY 7 

3.3. Splitting. For < fj, < fj,Q{e) , the invariant tori T^ still have stable and unstable mani- 
folds of dimension 3. These stable and unstable manifolds intersect transversally in the energy 
surface, along an orbit which is homoclinic to the torus. 

The first point is tliat tlie tori remain liyperbolic, and that the stable and unstable manifolds 
are deformed, but not destroyed by the additional term. This results from the observation 
that the manifold M formed by the union of the invariant tori is normally hyperbolic in its 
energy surface. Note that this step does not require exponential smallness of //. 

It is then a very general result that the stable and unstable manifolds have non-empty 
intersection. It is a global property, which can be established by variational methods, and 
which still does not rest on exponential smallness of //. 

The key point, where exponential smallness is required, is transversality. Since transver- 
sality is a generic phenomenon, one may think that this step is not so crucial. And indeed, 
it is very likely that the statement remains true for most values of /x G]0, e] (and not only 
for jji, ^ ^^{e)). However, there are two important issues here. First, transversality is hard 
to establish on explicit examples. Second, it is useful for many further discussions to obtain 
some quantitative estimates. 

Indeed, we can associate to the intersection between the stable and unstable manifolds 
a quantity, the splitting, which in a sense measures transversality. Discussions on such a 
definition are available in [8]. Using methods of Poincare and Melnikov, Arnold showed that 
this splitting can be estimated, for sufficiently small e, by 

(3) a•^^le~^'^' + 0{^?). 

This implies non-nullity of the splitting, hence transversality, for small /i. 

3.4. Transition chain. We have established the existence, when /i > is small enough, of 
a family T^^ of hyperbolic invariant tori such that the stable manifold W^^ and the unstable 
manifold W~ intersect transversally along a homoclinic orbit (but not along T^^\) for each w. 

A stability argument shows that the stable manifold W^ of the torus T^ intersects transver- 
sally the stable manifold W~^ of the torus T^^g when oj is close enough to ujq. How close directly 
depends on the size of the splitting. We obtain heteroclinic orbits between tori close to each 
other. 

Given two values uo and u)' , we can find a sequence LOi,l ^ i ^ N such that loq = lo, 
LOjq = (jj' , and W~ intersects transversally W^-^ for all i. The associated family T^. of tori is 
called a transition chain. 

The left step consists in proving that some orbits shadow the transition chain. Arnold 
solved this step by a very simple topological argument which, however, does not provide any 
estimate on the time T. He proves the existence of an orbit joining any neighborhood of T^ 
to any neighborhood of T^^i . This ends the proof of the main theorem, since we can chose uj 
and cj' such that to < A < B < uj' . 

The dynamics associated to hyperbolic tori and transition chains have later been studied 
more carefully. It particular, a A-lemma can be proved in this context, which allows to 
conclude that, in a transition chain, the unstable manifold Wq of the first torus intersects 
transversally the stable manifold of the last torus W^^. These detailed study also allow to 
relate the speed of diffusion to the splitting of the invariant manifolds. 

3.5. Diffusion speed. It is interesting to estimate the speed of evolution of the variable pi, 
or in other words the time T in the statement. It follows from Nekhoroshev theory that this 
time T has to be exponentially large as a function of e. In fact, it is possible to prove, either 
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by recent developments on the ideas of Arnold exposed above, or more easily by variational 
methods, [4], that 

gC/v^ 



T ^ 



for /x ^ Aio(e)- This time is of course highly related to the estimate ([3]) of the splitting. In 
addition, Ugo Bessi proved that one can take /io(£) = e~^''^. Plugging this value of /U in 
the estimate of T, we get the estimate T ^ e'~^'^ as a function of the only parameter e. 
Considering the fact that the orbit we have described go close to double resonances, this is 
the best estimate one may hope for in view of the improved Nekhoroshev stability estimates 
at resonances. 

The idea is now well spread that the time of diffusion is exponentially large. However, we 
point out that, if it is indeed exponentially small as a function of the parameter e, it is only 
polynomially small as a function of the second parameter ^, as was first understood by P. 
Lochak and proved in [3] using the variational method of U. Bessi. 

3.6. Conclusion. The theories of instability are developing in several directions. One of 

them is to try to understand the limits of stability, and to test to what extent the stability 
results obtained so far are optimal. This aspect has quickly developed recently, for example 
the optimal stability exponent a for convex systems is almost known. Another direction is to 
try to give a description of unstable orbits in typical systems. This remains a widely open 
question. 

Let us finally mention that the application of the theories we have presented to concrete 
systems is very difficult. One of the reasons is that the estimates of the threshold eo of 
validity of Nekhoroshev and KAM theorems that can (painfully) be obtained by inspection 
in the proofs are very bad. Much too bad, for example, to think about applications to the 
solar systems with the physical values of the parameters. 
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